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Interdependence of Institution and Public Calls for Valid Lines of Communication 



By Timothy Welch 

“Feedback” is elementary. When talking to a 
second party, the first party looks for feedback signs 
of approval or disapproval. When talking to an audi- 
ence, a speaker looks for feedback indicators of the 
crowd reaction. 

If there are only two people whose feedback must 
be attended, sampling obviously isn’t necessary. 
When the universe becomes 20, or 200, or 200,000, 
however, feedback gets lost. Yet the reaction of 
these individuals, whether as voting taxpayers or 
consumers who vote through their purchases, is 
vital. 

But the “drop-in” vote is badly skewed. That is, 
telephone calls and letters received in anger can’t 
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be regarded as a “normal” sample. At best, some 
indication is being given about the flash point of 
certain members of the population : what it takes to 
ignite certain people to the point of complaining. 

A related objection can be made to the sampling 
of advisory committees set up by the college. Al- 
though the committees are invaluable, they are by 
nature specialized, and could constitute a good sam- 
ple only if they were accurately matched to the 
population strata. This, of course, isn’t the reason 
for advisory committees, and they are not so 
matched. 

The temptation is to give too much weight td these, j 
in a sense, “free samples.’ 
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All this is only to restate the obvious : that sam- 
lin^^ is a continual process, basic to existence, which 
oes on every waking hour. The choice is whether 
le sampling should be disciplined or haphazard. 
Jertainly for public information, or public relations, 
r community service, or community relations, 

, valid sampling program is basic. The voters — 
lolitical or economic — are out there, and their votes 
nust be counted. 

The needs for a simple sampling procedure are 
rarious. Periodic, general-impression polls, or 
'image” polls, with continuing questions, are valu- 
ible for long-run estimates of how the sampling 
igency is seen. Postcard polls can gauge opinion on 
fast-rising issues. A rotating mailing list for news- 
etters, drawn >as a sample, will achieve equal dis- 
■ribution among the voting population. 

Sampling procedures are necessarily statistical, 
md for a measure of any complexity should be 
Lurned over to professional sampling organizations. 
But for samples of a modest, day-to-day scope, it 
isn’t practicable to pay a sampling organization. 

The purpose of this article, then, is to suggest 
sampling methods for local agencies with limited 
budgets. 

The best definition of a school district’s population, 
it is a.ssumed, is the most current list of rercis- 
tcred voters. This is an opinion-forming group, and 
one which acts on its opinions. But even more to the 
point, it is the “employer group” whose opinion is 
crucial. For a public body dependent upon taxes, reg- 
istered voters are, for practical purposes, the true 
population. 

Note that a moot ethical point is couched here. 
Shouldn’t all taxpayers who contribute to the sup- 
port of a school district be considered? In a sense, 
anyone who pays any taxes for the support of a 
school is an “employer.” Further, shouldn’t all citi- 
zens over 21 year.s be included? 

Perhaps. But the objection can be answered by 
noting that, as a practical matter, a high correlation 
exists between voting registration and property-tax 
payers who primarily support public schools. As for 
the ethics of excluding nonvoters, it may be argued 
that they are obviously not so interested in making 
decisions. A democracy is the product of those who 
take part in its operation. In any population, it is the 
active decision makers whose opinions should be 



sought, for their opinions are likely to be dominant, 
informed, and directive. 

The most current available list of registered voters 
can be the frame, then, from which the sample is 
drawn. (A “frame” is simply a list of the sanriple 
units in the population.) Frequency of the list's 
changing should be noted to keep its information 
accurate. 

County clerks in many instances sell such lists of 
registered voters. In Santa Clara County, California, 
for example, lists of the county’s 300,000 registered 
voters may be purchased for 60 cents per thousand 
names. In Fresno County, California, lists cost 50 
cents per thousand names, broken into some 760 
precincts with between 175 to 200 names for each 
precinct. Some counties, such as Fresno, restrict the 
sale of this list to people using it for a political pur- 
pose, such as information about a bond issue. It may 
be successfully argued by a public taxing age icy 
that communication with its employers is always 
political. Purchase of the entire list can still be 
expensive, however. In this event, a sub-sample of 
the population (by precincts) can be drawn so that 
an entire voting list needn’t be purchased. 

To help keep the sample list current, the names 
should be replaced (for instance, “Local taxpayer,” 
or “Junior college taxpayer,” or “Friend of Blank 
College”) to lessen the problem of nonresponse occa- 
sioned by people moving. 

Choosing the Sample 

Given a voter registration list of 300,000 names, 
how is a sample drawn? 

How many need to be reached to comprise an ade- 
quate sample? A balance must be struck between 
squandering money on too large a sample, or getting 
too small a sample for any confidence. (It should be 
noted, by the way, that the accuracy of an estimate 
of a proportion does not depend upon the size of the 
population — only upon the size of the sample. ^ 
Stated one way : how high a probability of being 
accurate, within stated percentages, is desired? 

Stated another way : how precise a sampling, sep- 
arate from the question of your confidence in the 
sampling, is required? 

A guide is needed to allow the sampler to decide 
how much accuracy he wants to purchase for his 
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sample, and to define just what confidence he can 
place in a sample of a given size. 

Once systematic bias has been eliminated by a 
well-drawn random sample, ^ the question of sample 
size and confidence interval becomes mathematical. 
So the mathematics of sampling must be briefly 
touched upon. 

To determine the size of the sample to be drawn, 
we have to know two things : (1) How precise do we 
want to be? That is, how closely do we want our 
sample characteristics to match the real population 
characteristics? (2) And, how certain do we want 
to be that our sample falls within the above limits? 

If, for example, there were a 60-40 split in the 
party registration, suppose we wanted a .95 proba- 
bility that the error of estimate would not exceed 
.02 units. In ether words, that our sample would 
draw 58 to 62 per cent Democrats, and, conversely, 
42 to 38 per cent Republicans — and that we could 
be 95 per cent sure that our sample v/ould do this. 
(The other 5 per cent of the time, we could miss 
. badly, and draw heavy proportions from one party 
or the other, and misrepresent the political opinions 
of the population.) 

A simple formula gives our sample size, as soon as 
we make the policy decision of how much accuracy 
and how much certainty we want. The formula is: 

‘02pq 

Here *0 is the certainty figure (95 per cent) ,® p and q 
are the corresponding proportions of Democrats (60 
per cent) and Republicans (40 per cent) in the regis- 
tered voting list, and e is the accuracy figure.^ 

A further table yields the “certainty figures” to 
insert in the formula: 

95 per cent 1.96 

97.5 per cent 2.24 

99.8 per cent 3.09 

So, then, to be 95 per cent certain that we get in 
our sample ± 2 per cent of the actual number of 
Democrats and Republicans, the formula for the 
sample size is 

„ = (.02). = 2.294 in sample 

If we wanted to be 97.5 per cent certain that we 
were ± 2 per cent accurate : 

„ = (2-24)i^^.. ^^ (-60) (.40 ) ^ 3 

If we wanted to be 99.8 per cent certain that we 
were ± 2 per cent accurate . 

n = = 5.729 in sample 



It should be obvious from the above examples that 
the cost of the sample rises as we increase both 
certainty and accuracy.® 

The point should be made by now that two things 
may be “purchased” in a sample: precision or ac- 
curacy, and degree of certainty. The extent to which 
we desire each determines the size of the sample, 
and its cost.® 

A Typical Survey 

As an illustration, let us run a postcard survey 
which might be made to find out the voters’ feelings 
about a bond proposal to build a combination foot- 
ball-baseball stadium. 

Although party registration is a potent variable 
(along with age, sex, education level, income, re- 
ligion, race), let’s say that we’re interested in an- 
other variable: voting behavior in the last school 
election. 

Since turnout for these elections rarely reaches 
the figures of a national election, let’s say that ll per 
cent of the registered voters of the county, or school 
district, voted."^ How large a sample would we need to 
be 99.74 per cent certain that our sample proportion 
would represent ± 3 percent of the real 11 per cent 
of the voting list who actually voted? The sample 
size is given by 



n 



*0^pq (3)*^ (.11) (.39) _ qwq 

— ro3p 



which determines that 979 voters must be taken from 
the frame for this sample. 

An absolutely strict requirement is that they be 
chosen randomly : the validity of the entire operation 
rests on this point. How does one choose these 979 
voters randomly? 

A table of random numbers, available in most 
statistics books, is the easiest manner of choosing the 
979 voters. Assuming a frame of 760 precincts and 
150,000 registered voters, we go through the random 
number table in a systematic mianner for three-digit 
numbers from 001 to 760 until there are 979 precinct 
numbers listed. 

Then, with each precinct list (in which the randoni 
number of voters appears on the front) , we go down 
the random number table for the first three-digit 
number appearing within that interval. For ex- 
ample, if there are 182 voters in the 546th precinct 
and our precinct draw has told us that we mus 
have two voters from the 546th precinct, we look 
for the first two three-digit numbers between OOl 
and 182. 

These names and addresses are the result. 

When the 979 addresses have been assembled, th 
mailing list is complete. If one person is pull 
twice in the same sample, his response must be give 
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iuble weight. Random sampling demands that each 
jniber have an equal opportunity of being drawn 
each draw, and not eliminated from the list. 

The problem of nonresponse must be touched on 
ieily. It is almost certain, of the 979 postcards we 
iid, that there will not be 979 returned. 

From a statistical point of view, this must be 
indled carefully, for nonrespondents may be a spe- 
al type of person whose voting behavior may be 
lated to our study. It will not suffice to simply 
■aw additional names. That is, these nonrespond- 
ts are saying something to us by not responding, 
d this message is as important to us as that on 
e returned postcards, if we are to keep our sample 
andoiii and representational. 

1 In the example at hand, nonresponse would prob- 

[bly best be handled by setting up such a category. 

I would be simplest to treat the message of their 

lonresponse as indifference to the issue, which could 

bdicace that they are a part of the 89 per cent who 

lid not vote in the last school election and/or that 

bey don’t care enough about the issue to vote in an 

election which concerns it. 

* % 

I The response will almost certainly be greater than 
il per cent of the 979 sampled, indicating that more 
people will have opinions and reply than voted in the 
last election. 

I If greater rigor is desired, the problem of non- 
response may be met with telephone and field visit 
jollow-up. In this event, the sub-sample is weighted, 
md the results projected. For instance, if there are 
too nonrespondents, and a random sub-sample of 
Iwenty is contacted, their replies will be multiplied 
\y five. For such a sub-sample, a very simple sam- 
pling method, such as taking every fifth nonrespond- 
^nt, is adequate.® 



The Means of Asking 



! A further difficulty lies in the area of question- 
naire construction. It is assumed that the postcard 
furvey used here, or likely to be used, will be short 
jmd brief, with check boxes for simple, nonambigu- 
ws answers. If, however, a long form is sent, then 
he matter of good questionnaire construction should 
)e examined to obtain sharp, focused results.® 

The techniques of handling interviews is also a 
leld to itself, which requires detailed consideration 
0 avoid methodological problems for accurate data 
•eduction and analysis.^® Both questionnaire and 
hterview design are beyond the scope of this article. 

Statistics and sampling are the study of certainty, 
selling us, in an uncertain world, how sure we may 
36 of such-and-such. 

In the future, the interdependence of an institution 
ind its various publics can only grow, creating the 



need for valid lines of communication. Certainly it 
is sound psychology to ask people how they feel about 
something before doing it to them or for them. Be- 
cause of the large populations involved, disciplined 
sampling procedures are the necessary means of 
asking. 



1 Hoel, Paul G. Elementary Statistics. New York: 
John Wiley & Sons, Inc., 1960, p. 98. 

2 The two important points which enable the binomial 
distribution to be applied to practical sampling problems 
best define “random" : that each sample be independent of 
the other, and that the probability of any unit being 
chosen from the universe be exactly the same. See Hoel, 
p. 70. 

3 Hoel, op. cit., pp. 96-97. This equation is derived 
from the more familiar 

8p = pq or n = pq 
n 8p® 

* Note that although estimates of p and q as 60 per 
cent and 40 per cent are possible here (since we know the 
gross figures from the registered voting list), it would 
be possible to draw a sample on the assumption that both 
Democrats and Republicans appeared in equal numbers 
in the population. In other words, p = .50 and q = .50, 
which would reduce the sample size necessary to 1,225, 
since there would then be an equal chance of drawing 
either a Democrat or a Republican in a random draw. 
However, since we know from the voting list that there 
are not equal numbers of each party in the population, 
the sample size has to increase to be 95 per cent certain 
that we approximate the real 60-40 percentage split 
within the actual population. Drawing as though there 
were a 50-50 chance of drawing a Republican would result 
in a bias favoring Democrat representation in the sample. 

It costs more to increase the accuracy in the error of 
estimate in a sample than it does to increase the certainty. 
Taking 95 per cent certainty at ±4 per cent accuracy as 
a base, it would require a sample of 2,305 to get ±2 per 
cent precision at 95 per cent certainty, but would require 
only 706 to 97 per cent certainty at ±4 per cent accuracy. 

® Two assumptions are crucial to this method of esti- 
mating p in this formula: that a large sample is involved, 
and that the normal curve approximation to the binomial 
distribution is satisfactory. For justification of both as- 
sumptions, see Hoel, op. cit., pp. 77-78, 97. A good back- 
ground may be obtained from the first six chapters of 
Hoel. 

" Since district and county lines are not always cotermi- 
nous, a suitable frame must be constructed by purchasing 
appropriate precinct lists which approximate the school 
district. 

3 Yates, Frank. Sampling Methods for Censuses and 
Surveys. 3rd ed. New York: Hafner Publishing Com- 
pany, 1960, p. 108. 

® A good source is Selltiz et. al., Research Methods in 
Social Relations (rev. ed.). New York: Holt, Rinehart, 
and Winston, 1962, chapters 7 and 10. 

Maccoby, Eleanor E., and Maccoby, Nathan, in Lind- 
sey (ed.). Handbook of Social Psychology. Vol. I. Read- 
ing, Massachusetts: Addison-Wesley Publishing Com- 
pany, 1954, pp. 449-487. 
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